Mirex 2012 Submission Audio Classification Using Sparse Feature Learning

نویسندگان

Juhan Nam

Jorge Herrera

چکیده

We present a training/test framework for automatic audio annotation and ranking using learned feature representations. Commonly used audio features in audio classification, such as MFCC and chroma, have been developed based on acoustic knowledge. As an alternative, there is increasing interest in learning features from data using unsupervised learning algorithms. In this work, we apply sparse Restricted Boltzmann Machine to audio data, particularly focusing on learning high-dimensional sparse feature representation. Our evaluation results on two music genre datasets show that the learned feature representations achieve high accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mirex 2011: Automatic Audio Tag Classification via Sparse Coding

This extended abstract details our submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2011 for the audio tag classification task. First of all, we extract a fixed-length feature vector (composed of some timbral as well as modulation spectrum features) from each song clip. Then, by using l-reconstruction to represent each test song clip as a linear combination of all train...

متن کامل

Mirex 2012 Submission Audio Classification Using High-dimensional Representations Learned on Standard Audio Features

We present a training/test framework for audio classification using learned feature representations. In contentbased music information retrieval tasks, standard audio features such as MFCC and chroma are typically used to represent the music content. As an alternative, there is increasing interest in learning feature representations from data using unsupervised learning algorithms. In the previ...

متن کامل

Mirex 2011: Music Geren Classification via Sparse Representation

This extended abstract details our submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2011 for the audio training\test task. First of all, we extract a fixed-length feature vector (composed of some timbral as well as modulation spectrum features) from each training clip. Then, by representing a fixed-length feature vector (extracted from a test clip) as a linear combinati...

متن کامل

MIREX 2010 Audio Onset Detection

This paper presents an approach for the Audio Onset Detection task [1], which is submitted to MIREX 2010. In MIREX 2009, we presented our approach that utilizes information on the general characteristics of the notes for onset categorization, as well as integrates energy-based and pitch-based detection results. In MIREX 2010, we extend our submission to MIREX 2009 by parameters fine-tuning and ...

متن کامل

Mirex 2012: Mood Classification Tasks Submission

In this work, three audio frameworks – Marsyas, MIR Toolbox and PsySound3, were used to extract audio features from the audio samples. These features are then used to train several classification models, resulting in the different versions submitted to MIREX 2012 mood classification task.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Mirex 2012 Submission Audio Classification Using Sparse Feature Learning

نویسندگان

چکیده

منابع مشابه

Mirex 2011: Automatic Audio Tag Classification via Sparse Coding

Mirex 2012 Submission Audio Classification Using High-dimensional Representations Learned on Standard Audio Features

Mirex 2011: Music Geren Classification via Sparse Representation

MIREX 2010 Audio Onset Detection

Mirex 2012: Mood Classification Tasks Submission

عنوان ژورنال:

اشتراک گذاری